Document Set Redundancy Compression Method Using Template Differential

نویسندگان

  • Ping Yu
  • You Yang
چکیده

Document image information systems are used more and more in government. Much redundant information in the document existed in such systems. That implies the research on the compression method based on the page-page statistical features is quite significant. Set Redundancy Compression (SRC) is such a technique that reduces the total entropy of the whole image set by utilizing the image page’s similarity. Compression-based Template Differential (CTD) is an improved SRC. The similar image set is constructed by the document template. The coding performance is improved by adding the template image into the Min-Max Differential (MMD) coding/decoding model. It proves theoretically that CTD’s coding performance is higher than MMD’s. It is demonstrated by experiments that both the CTD and MMD are benefit to increase the compression ratio of image set, however CTD increases more than MMD.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proceedings of the International Conference on Image Processing , 1996 STRUCTURE - PRESERVING DOCUMENT IMAGE COMPRESSIONOmid

Maintaining a document in image form is often preferable in order to avoid the high cost of manual conversion or the introduction of large numbers of errors by automatic OCR and/or graphics interpretation. The large volume of data in the image can be greatly reduced by using compression techniques. Text-intensive document images typically have a great deal of redundancy in the bitmap representa...

متن کامل

Structure-preserving document image compression

Maintaining a document in image form is often preferable in order to avoid the high cost of manual conversion or the introduction of large numbers of errors by automatic OCR and/or graphics interpretation. The large volume of data in the image can be greatly reduced by using compression techniques. Text-intensive document images typically have a great deal of redundancy in the bitmap representa...

متن کامل

On the Effectiveness of using Sentence Compression Models for Query-Focused Multi-Document Summarization

This paper applies sentence compression models for the task of query-focused multi-document summarization in order to investigate if sentence compression improves the overall summarization performance. Both compression and summarization are considered as global optimization problems and solved using integer linear programming (ILP). Three different models are built depending on the order in whi...

متن کامل

Multi-Document Summarization By Sentence Extraction

This paper discusses a text extraction approach to multidocument summarization that builds on single-document summarization methods by using additional, available in-, formation about the document set as a whole and the relationships between the documents. Multi-document summarization differs from single in that the issues of compression, speed, redundancy and passage selection are critical in ...

متن کامل

A Comparison of Set Redundancy Compression Techniques

Medical imaging applications produce large sets of similar images. Thus a compression technique is necessary to reduce space storage. Lossless compression methods are necessary in such critical applications. Set redundancy compression (SRC) methods exploit the interimage redundancy and achieve better results than individual image compression techniques when applied to sets of similar images. In...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JSW

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2012